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Abstract: The angular power spectrum of a stationary random field on the sphere is es- 
^H ^ timated from the needlet coefficients of a single realization, observed with increasingly fine 

r^ , resolution. The estimator we consider is similar to the one recently used in practice by (Fay 

• 1 et al. 2008) to estimate the power spectrum of the Cosmic Microwave Background. The 

,il^ ' consistency of the estimator, in the asymptotics of high frequencies, is proved for a model 

"*li ' with a stationary Gaussian field corrupted by heteroscedastic noise and missing data. 
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1. Introduction 

In many application domains (geophysics, cosmology, hydrodynamics, computer vision, etc.), data 
are defined on the sphere. If the data fit the model of a stationary stochastic field, their second order 
characteristics, summarized by the angular power spectrum, is of great importance. It contains 
all the distribution information in the case of a Gaussian stationary process. Ordinary spherical 
harmonic transform (SHT), the equivalent of the Fourier Series on the circle, provides a fast and 
efficient method for spectral estimation in the idealistic case of a fully and perfectly observed 
sphere. 

However, rarely the data are available on the whole sphere. Often it is observed under non- 
stationary contaminants. This is the case for the cosmic microwave background (CMB) which is 
a major motivation for this work. For those reasons, during the past decade, localized analysis for 
spherical data has motivated many developments; see [TH [TOl HOI IS] and the references therein. 

The wavelets provide a powerful framework for dealing with non-stationarities. A recent con- 
struction of wavelet frames by [16(ll7j has proved efficient to analyze stationary spherical processes, 
thanks to their good localization property. In the time series literature, wavelets are used for spec- 
tral estimation whether in a semi-parametric (see e.g. [Ej) or a non-parametric [3] context. Our 
observation model, in addition to being spherical, has the particularity of presenting quite general 
non stationarity (in the structure of the noise) and we failed to find any reference on the subject 
even for processes living on Euclidean spaces. 

In this paper, we establish the consistency of a spectral estimator constructed on the needlets 
coefficients in high-frequency asymptotics. 

The paper is organized as follows. In Section [2] we present the model, including assumptions on 
the way the process is sampled. In Section [3] we define the needlet spectral estimators and state 
the consistency results that hold true under realistic conditions. The finite sample behavior of the 
estimator is explored by numerical simulations (Section [4]). Our conclusion is given in Section [5] 
and Proofs are postponed to Section [6l 

2. Model and settings 

2. 1 . Gaussian stationary spherical fields 

Consider the unit sphere § in B? with generic element ^. The geodesic distance is given by d(^, ^') = 
arccos(^ • ^') where ^ • ^' denotes the usual dot product between ^ and £,' (considered as vectors in 
R^). The uniform measure d^ is the unique positive measure on S which is invariant by rotation, 

with total mass Air. Let EI = L^(§, d^) be the Hilbert space of complex- valued square integrable 
functions. We have the following decomposition: H = ^'^qV.i where H^ is the vector space 
of spherical harmonics of degree £, i.e. restrictions to the sphere of homogeneous polynomials of 
degree £ in R"^ which are harmonic (or, equivalently, the restriction of which are eigenvectors of the 
spherical Laplacian with eigenvalues t{i-\-\)). The usual spherical harmonics y£,Tn(Oi —t<m<£, 
constitute an orthonormal basis of Hf. Therefore, the set of all spherical harmonics y^.m, ^ > 0, 
— ^ < TO < £, is an orthonormal basis of H. 

In this paper, we shall be concerned with a zero-mean, mean square continuous and real-valued 
random field X{£). We shall assume that X is second-order stationary, that is E [X{p^)X{p£_')] = 

E [X{S)X{£^')\ for all p £ 50(3). Then the spherical harmonics coefficients of X, a^.m = (X, yf,m.)jj, 
are square integrable random variables which verify E [af_,„a|, „j,] = Si_iiSm,m'C£ for m,m' > and 

ai^-m = al ^. The inverse spherical harmonics transform reads: X{S,) = J2e>o X]m=-f o,i,m'^i.m{£,)- 
The last equality holds in the sense that E I /„ X{£_) — J2e=o J2m=-e CLi,m^i,m{0^^\ — * 0- The 
sequence (C^)^>q is called the (angular) power spectrum oiX. Let Lf, denote the Legendre polyno- 
mial of degree i normalized by i^(l) = ^^^- The angular power spectrum is linked to the angular 
correlation of X by the relation X]f>o CiLgicos 0) = E [X(^)X(^')] for all pairs of points such that 
d{(,,C) = ^- The square integrability of X is equivalent to the condition X]e>o(^^ ~^ ^)Ce < oo. 
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We shall also assume that X is Gaussian. This additional assumption is known to be true if 
and only if the coefficients at.m, ^ > 0, m > are independent (see [2] for the "only if" part). 
As mentioned in the Introduction, the finite-dimensional distributions of a Gaussian stationary 
field are entirely determined by the second-order characteristics, that is by the angular power 
spectrum of the field. For instance, the second-order stationarity is equivalent, under Gaussian 
assumption, to the strict stationarity, i.e. for all p £ S0{3) and ^i, . . . ,^„ G § the two vectors 
(X(p^i), . . . , X{p£^n)) and {X{^i), . . . , X(^„)) have the same distribution. 

2.2. Sampling on the sphere 

In any real-life situation, only discrctized versions of X arc available, and consequently spherical 
harmonic coefficients are exactly computable only if X is i-band-limited, that is all the a^m ~ 0, 
£ > L, for some L which depends essentially on the number of observed points. The discretization 
of the sphere and achievement of cubature formulas for geodetic functions is a non-trivial task. 
During the last decade it was shown ([HIITT]) that there exists a constant co > such that for all 
1/ e N* there exists a set (^fc, Xk)ke{i,...,N} S (S x R;^)^ of cubature points and weights (referred 
to as a pixelization of order L) with the following properties. 

For all / e 0^ Hf, / /(O^C = V^ ^fc/(Cfc) (cubature formula). (la) 

Cg^L^ <N< coL^. (lb) 

Cn L^ < min Xk < max \k < coL~ . (Ic) 

l<fc<JV l<fe<JV 

Ca^L^^ < supd(C, {6}fce{i,...,JV}) < cqL"^. (Id) 

Co'L-^ < ^i^indi^k,^k') < coL-\ (le) 

l<k<k' <N 

The following two lemmas derive straightforwardly from these pixelization properties. The first 
one is proved in [T] and the second one follows from a simple covering argument. 

Lemma 1. There exists a constant c > such that for all L € N* , ^ €i S and for all pixelization 
of order L we have 

M-^N 1 

^k^i{i+Ld{t^k)r' -"" ■ 

Lemma 2. There exists a constant c > such that for all L G N*, ^ G § and S > CqL^^ and for 
all pixelization of order L we have 

c~^d^L^ < Card {fc G {1, . . . , TV} : d(C, £,k) < S} < c5^L^ . 

2.3. Observation model 

We are now in position to give a description of our statistical model. Assume that wc observe a 
noisy and sampled version of low-passed X at successive scales j, with some missing (or attenuated) 
data. More precisely, for every j g N, given some Lj G N*, 

• let iS,j,k, Xj,k)kG{i,...,N } be a pixelization of order ALj ; 

• let Wj : k G {!,.'.. I Nj} ^ W-j^k e [0,1] and a, : k € {l,...,Nj} ^ aj,k G M+ be 
deterministic, known, functions ; 

• and let B^ : ^ e N i~> Bj,f S M such that Bj,£ = 1 ii £ < Lj and Bj,f == if ^ > 2L^. 



We observe 



Yj,k = W,,k [X, iCj^k) + Z,,k] , J e N, A: e {1, . . . , TV,} (2) 
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where Zjj^ = '^j,kUj^k and C/j,fe, j € N, k G {1, . . . , Nj} is a triangular array of standard and 
independent Gaussian random variables and independent of the process X. The process Xj is 

defined by 

i 

Xj = 2_^ Z_^ '^j,tai.,m^t,m , ai,m ~ {X , y£,m)jj • (3) 

£>0 m=-t 

In any CMB experiment, some smoothing is induced by the instrumental beam. Eq ([3]) is a idealistic 
version of this low-pass operation. 

Without loss of generality, j i-^ Lj is supposed non-decreasing. In the following, we call Wj the 
mask. The particular case of Wj taking its values in {0; 1} corresponds to missing data. 

In other words, a single realization oi X is considered, but independent and noisy measures with 
an increasing spatial resolution are available. This corresponds, for instance, to the observation 
model of the CMB. The latter is modeled by astrophysicists as the single realization of a stationary 
Gaussian process. Its observation is achieved by more and more precise instruments, involving 
their own observation noise, sky coverage and instrumental beam. Full sky map of moderate 
resolutions {e.g. maps provided by the WMAP collaboration [4j) and observations of small and 
clean patches of the sky at very high resolution {e.g. maps from ACBAR experiment [T5]) are 
available simultaneously. Cosmologists aggregate information for those maps to give a large band 
estimation of the angular power spectrum. 

2.4- Needlets and statistical properties of needlet coefficients 

2.4.1. General framework 

The needlets are second-generation wavelet frames which were introduced by [17]. Let us recall 
below their definition and first properties, the proofs of which are either referred to existing 
literature or postponed to Section [6l 

Start from the fact that the orthogonal projection on EI(> has a kernel involving Lcgendre 
polynomials, namely 

i /■ 

Instead of considering single frequencies £, we shall combine them within frequency bands. For 
this purpose, define a sequence of functions bj : £ € N 1-^ bj^i € M, j € N, called (frequency) 
window functions. The window hj is supposed to be supported in [0, L , ] for some L), G N. 
The kernels *, : (C,?') ^ j:i>obj4Li{^ ' C) and A, : (^,^') ^ Y.i>a{Ki?Li{i ■ C) have the 
two following obvious properties. First, for all / e H, /(^) = X^ieN /s^j(^'?')/(^')'^^'- Second, 

A,(c,e') = /s*.(e,r)*,r,oc- 

The discretization of the above kernels leads to the following spherical functions called needlets. 
For each scale j € N, given a pixelization {^j_k, '^j,k)ke{i n} of order at least 2L- ' , define 



i^jAO = VKk^h.j^iLi{£,-S,k) 



dcf 

\/^],k 



The needlets Vj^fc, 3 € N, k £ {1, ... ,7V,} constitute a tight frame of H [T71 IH] if for all I G 
^' Sjgn(^j-'')^ ~ f- -'^'-"' ^^^y (possibly random) function / in H, the coefficients {f,ifij,k)iii, are 
renormalized for sake of notational simplicity: we shall handle the needlet coefficients 

7j,fc =^(Aj,fc)"^''^(/,V^j,fc)jj . 

If / is i J -band-limited, one can compute practically the coefficients jj^k, j < J, in the spherical 
harmonics domain, from the values of / on the cubature points, as made explicit by the following 
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diagram. 

CTTrp ^ OTLJrT-1 — 1 

(/(?fc))l<fc<JV * (a«,m)(,<^(^b, =^ {b],t'^t,ra)i,<L. =^ (7j,fe) i<fc<Ar ■ (4) 

The initial pixelization {S,k, \k)i<k<N must be of order at least '2Lj . SHT denotes spherical har- 
monics transform, computed from the samples of X and of the y^^m's thanks to (fla| . Double 
arrows denotes J operations. 

Since the needlet coefficients at a given scale j depend only on a finite number of values of the 
function /, it is possible to generalize this notion to an arbitrary (possibly random) finite sequence 
{fk)i<k<N G M^ . The needlet coefficients of such sequence are the quantities 

p=i e>om=-e p=i 

If fk — f{£,i,k) for some / € ®f^o ^'^'^ ^^ are the points of a pixelization of order at least 2Lj, 
then the above expressions arc equal to ^j.k- 

Let us give the first properties of the needlet coefficients of a random field at a given scale 
j G N. Let X be a stationary field, X, like in Eq. ^ with B^^e = 1 if £ < hf^ and Bj^i == if 

I > 2L^- , and {^j^k, ^j,k)i<k<Nj a pixelization of order 4L^ '. The needlet coefficients of X are 
denoted r?j^fe. They are also the needlet coefficients of Xj since X and Xj have the same spherical 

harmonics coefficients up to the frequency £ = L^ . In the presence of an additive noise Zj^k in 
the observation of X(^j^fc), the "observed" needlet coefficients computed by (|H) from X + Z write 
rjj^k + Q,k, where the coefficients Q^k are the needlet coefficients of Z. The next results provide the 
covariance structure of those coefficients at scale j. In our model, X and Z are supposed Gaussian. 
In this case, their needlets coefficients are Gaussian too. 

Proposition 3. Denote {Ce)i>o the power spectrum of X . Its needlet coefficients rjj^k dre centered, 
with covariances given by 



Gov [r/j,fc, -q^.k'] = ^ {bj.if CeLe{^j,k ■ S,j,k')- 



£>0 



Proposition 4. Assume Z of the form Zj^k = <^j.kUj^k where the Uj^k cire uncorrelated, centered 
and unit variance random variables. The needlet coefficients Cj,k of Z are centered, with covariances 
given by 

Ni 



1 



\2 



Gov [Q,kX],k'] = , 2^ (^j.po-j.p) V'i,fc(Ci,p)^j,fc' (0,p)- 

\/^j,k^j,k' p^l 

2.4-2. Ti-adic needlets 

In this paper we shall fix some constant 23 > 1 and consider 23-adic window functions. 

Assumption 1. There exist M >3 and a M -differentiable real function a supported in [—23,23] 
and identically equal to 1 on [—23^"'^, 23^"'^] such that 



&j- , = b {'B-H) 



whereh{-) = a(-/S) -a(-). 



For such window functions, L- — 23-'+^. These spectral windows arc not as general as those 



J 



used by [5] . Indeed, it has been shown by [9] that one can take advantage of the relaxation of the 
23-adic scheme originally proposed in the definition of needlets to optimize their non-asymptotic 
localization properties. In the following, since we are concerned with asymptotic properties, we 
will make use of the 23-adic structure of Assumptions [1] and [H so that the spatial localization 
property of the needlet takes the convenient form of the next proposition. 
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Proposition 5 f [17j). There exists a constant c > which depends only on the function b such 
that for all j £ N, fc S {1, . . . , A^^} and ^ € S 

i^M-(e)i<- — -17 ■ 

The stochastic counterpart of this analytical result is that the needlet coefficients of a stationary 
field are asymptotically uncorrelated as j — > oo. except for points at a distance of order !B~^ or 
less. For this purpose and throughout this article, we make on the power spectrum of X the same 
following regularity assumption as in [U [121 HH US] • 

Assumption 2. There exist a > 2 and a sequence of functions gj : [S^^,®] ^ M, j G N, such 
that 

for every £ G ['B^~^ ,'B^~^^]. Moreover, there exist positive numbers co,...,cm such that for all 
j G N, Cq^ < gj < Co and for all r < M, sup |^gj('u)| < c^. 

Proposition 6 ([Tj). Let X be stationary with a power spectrum satisfying Assumption\^ and 
T]j k its needlet coefficients. Then there exist a constant c > such that for all j (z N and k,k' (£ 
{1,...,7V,} 

I Cor [rjj^k , Vj.k' ] I < ^ . ^^'^ ^ -17 • 

{1 + -B^di^,^k,^j,k')) 

Remark. A generalization of this li-adic framework, in a different direction to the one of Q' 
can be found in f^ M^A \11^ . The authors do not suppose that the function a (or h) is compactly 
supported and obtain localization and asymptotic uncorrelation results similar to Propositions [3| 
and\^ 

3. Estimation results 

In this Section, we present a new procedure for the estimation of the angular power spectrum of X 
in the model of Eq. ^ based on the needlet coefficients of Y . The properties of needlets described 
in Section 12.41 allow to take into account the local signal-to-noise ratio in the estimation of the 
(however) global spectrum. This spatial accuracy is at the cost of a lower frequential precision: 
not every value of Cf will be estimated, but only the mean values in the bands defined by the 
windows bj^(. 

3.1. Parameters and estimators 

We shall provide below estimators for the parameters 

C(^) "^^ (4^)-i ^(2^ + 1) (6,- ,)2 C, 



dcf 



which are smoothed versions of the power spectrum of X at successive scales. A comparison of a 
Riemannian sum to an integral shows that, under Assumption [51 ci23^'^~"' < C^-'^ < C2S^'^^~"^ 
for some ci, C2 > 0. 

A sequence of estimators C^^' is said consistent (in the high-frequency asymptotics) if £ (C^^ ' ) — 

0, where 
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The Needlet Spectral Estimators (NSE) are defined by 



N, 






dot v^ 


{l3,kf - 


- {nj,kf 


fc=l 







&''=y.^.A{l.kf^{n,^uf] (5) 



with some weights such that X]fc=i ^j.k = 1 and where 

r— H (^j,p'^j,pV'j,fc(Cp))^ (6) 



dcf 
^]-k = 1 . 

Aj.fe ' — 



1/2 



p=l 

is the standard deviation of the needlet coefficients of the masked noise (it follows from Proposi- 
tion[3]with k ~ k' and Wj,k(Jj.k in place of (Tj^k)- The jj.k's arc the observed needlet coefficients 
of Y and nj,k are defined in Eq. ^. The weights should ideally be designed according to the local 
noise level and the local effect of the mask. For this purpose, given a sequence {tj)j,zjq in M, let us 
define the set of kept coefficients 



%, "^^ <; 



ke{l,..., N,} : \ Y. ^^-.^(1 - W,,pf{^jA^j,p)) 
p=i 




As the second-order characteristics of the noise are known, its effect on the square of the needlet 
coefficients is completely subtracted in ([5|). Thus, the bias of the NSE is caused only by the mask. 

Proposition 7. For any choice of weights such that Wj^k ~ for k ^ 'X.j, there exists a constant 
c such that 

J r-. ^ < c'B'^H,. 

CO) - ^ 

In particular, if Wj = 1 (no mask), then for any choice of weights, C^' is unbiased. 

This result is completed by the following one about the variance of C when there is no noise. 

Proposition 8. Suppose that aj,k ~ for all k G {1, . . . , Nj} and that Wj± = for k ^ OCj . Then 
there exists c > such that 

Var (g(-'-)) ^ J ^ \^j,kw,,k'\ ^N^^ f N,t, 



C2 



.,tli(i + 25^'i(0,.,C..'))"' ^''' Vco) 



If the needlets coefficients at a scale j were independent, centered Gaussian variables with 
unknown variance C^' and observed with independent centered heteroscedastic Gaussian additive 
errors of variance {nj,k)^, then the Maximum Likelihood of C^^ would be reached for the value 
qU) given by (O with weights Wj^k oc (C^^^ + {uj^k) ) (see e.g. |S]). In reality, the uncorrelation 
of the needlets coefficients holds only approximately in the sens of PropositionlHl However, if some 
previous, possibly rough, estimates for C^'' are available, say C , the following weights are still 
meaningful. 

n^,,k = ^{c^'^ + in,,,fy\,ex, (7) 

where S = Y^kLi (C ^ + (»^i,fc) ) Ifeeac, • 

3.2. Consistency of the estimators 

Suppose that Assumptions[T]and|2]arc satisfied. Thcorcms[l]and|2]below provide conditions for the 
consistency of the needlet spectral estimators (O . Theorem [1] deals with generic weights whereas 
in Theorem [2| we consider weights of the form ([7]) . 
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Theorem 1. Assume that 
i) the weights Wj^k ctfe such that Wj^k ~ for k ^ 3Cj and that moreover 






E — 



(1 + I3^d(4,,4fe0) '-' 
and 







a) the noise verifies 



p=i 
iii^ t/ie thresholds tj defining the set of kept coefficients are chosen such that 



Then the sequence G^^' is consistent. 

Remark 1. Condition^ prohibits weights which are too much concentrated on few points k. For 
instance, thanks to Lemma\^ Condition\^ is trivially satisfied if 

max \wj,k\ - 0{N-^). 

Theorem 2. Suppose that the weights follow Eq. ([7]) and that 
i) there exists a constant ki > 1 and a sequence Tj in M^, such that 

\2 



(l + '^j) = O('B-'^i^) 



ii) there exists a constant K2 G M such that 






iiij i/ie thresholds tj defining the set of kept coefficients are such that 

t, = o(S-"^) ; 

iv) there exists a constant c > such that for all j G N 

Then, for M > max{^a±^ ; '^+°^~^\+"^^ }, i/ie seguence C^^' is consistent 

The hypotheses of Theorem [2] can be interpreted in the following way: Condition E]) describes 
a trade-ofF between an arbitrary upper bound on the noise variance and the number of observed 
points under this bound. The noise level can be as high as wanted (in the - very weak - limit of 
Condition [u])) in some regions, provided that it is under control at least in some other regions. 
An example is given below. Conditions ^ and [m} are not restrictive, neither is the condition on 
the number M of derivatives of a, since one is free to chose a infinitely diffcrcntiablc a. On the 
contrary, Condition |iv]) assumes a previous partial knowledge on the parameters C^-'^ to estimate, 
which is equivalent to the knowledge of the parameter a. 
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Frequency window functions 




Needlet profile, band j=15 



Needlet profile, band j=23 



Needlet profile, band j=30 






Fig 1. Needlets in frequency and spatial spaces. Top: window functions bj£ as functions of frequency i (in 
blue, the three ones corresponding to profiles plotted below). Bottom: Polar plots of the function 9 g [0, 27r[i— > 
y^ bj_fLf(cos9) for j = 15, 23 and 30, to illustrate their localization property. 



Example. Suppose that there exists a (5 > such that for each scale j, data with a noise variance 
less than y^Nj are available at least within a spherical disc of radius S. More precisely, there exist 



, /3 < 1 such that for all k with d{£,j^k,£,l) < S 



^* S § and Tj : 

• and aj^k < rjC'^^\ 
Then condition [1]) is satisfied. 



4. Simulations 

In this section, we investigate the non-asymptotic numerical performances of the NSE, in a rela- 
tively realistic setting. This is for illustration only, as there arc many free parameters in the models 
(mask, noise level) that are only asymptotically constrained by the previous theory. 

4-1. Settings 

The constant determining the width of the bands is set to 23 = 1.25. The function a of Assumption[l] 
is designed on [23"-'^, 23] by a polynomial of degree 19 connecting 1 to with M = 9 continuous 
derivatives, i.e. a(23~^) = 1, a('B) = and a(''^(23='=^) = 0, 1 < r < M. Frequency window functions 
hj^i and some of the associated needlets are displayed in Fig. [TJ 

In the analysis, the first 10 scales (0 < j < 9) are not considered since the support of the 
corresponding window functions is of width less or equal than 2. These scales correspond to 
frequencies £ < 8. The range of studied scales is 10 < j < 30. 

We use the HEALPix pixelization and associated direct and inverse spherical harmonic trans- 
form [5]. Needlet coefficients are computed from three experiments providing maps at respective 
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MC,iidl (HA) Siyudl (WX) Signal (WX) 



10 




Fig 2. Simulated data. Fist line: WX. Second line: Wa. Third line: WZ. Fourth line: Y . The columns correspond 
to the settings for the three experiments: A (fist column) used for 10 < j < 23; B (second column) used for 
23 < j < 26; C (third column) used for 26 < j < 30. 



HEALPix resolutions nside = 128, 256 and 512 {i.e. the number of cubature points N is 196.608, 
786.432 and 3.145.728). 

The simulated data arc illustrated in Fig. [51 A first "experiment" (hereafter, A), i.e. a mask an 
a noise level map, is used for scales 10 < j < 23. The noise level is reminiscent to the expected level 
form the forthcoming Planck experiment of the European Spatial Agencjo. Two other experiments 
(hereafter, B and C), with synthetic masks and noise- level maps, are used respectively for scales 
23 < j < 26 and 26 < j < 30. 

4.2. Results 

The distribution on the NSE estimators for first the scales 10 < j < 30 is estimated by 800 
Monte Carlo replicates in the setting of previous section. Figure [3] illustrates the convergence of 
the sequence C^^' . The approximate normal distribution of the estimator is highlighted by the 
quantilc-to-ciuantile plots of Figure [3] and the p- values from Anderson-Darling goodness-of-fit test. 

5. Conclusion 

In this work, we have proved the mean-square consistency of a spherical spectral estimator based 
on the needlet analysis of a stationary field with missing data and non-homogeneous additive 



^see www.rssd.esa.int/Planck/ 
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Fig 3. Box-and-whisker plots of the NSE estimator C^-'' . Regions A,B, and C delimits the range of the three 
experiments, from the coarsest (and wide coverage) to the finest (and small coverage) one. 
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Fig 4. Distribution of the standardized NSE estimator C^J> for bands 15, 23 and 30. The p-values are obtained 
from the Anderson- Darling and Cramer-von Mises tests of Gaussianity. At low frequencies, the estimator is sig- 
nificatively skewed. 
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noise. Our model is relatively realistic and could be enriched to fit the requirement of particular 
applications. A possible development of this work is the study of the rate of convergence, limiting 
law, efficiency, etc. of those estimators. 

Although this has not be stated here for brevity, it must be noticed that this method allows 
easy adaptation to a context with longitudinal data, i. e. of the form 

Y,,k.e = W,,k.e [X {^jm) + Z,.k,e] , J £ N, fc £ { 1 , . . . , A^,}, 6 G {1, . . . , Ej}. 

for some sequence of integer {Ej}j^^. This generalization of model ([2]) is of high importance for 
practical applications, when for each scale j multiple independent observations (indexed by e) of 
the signal of interest are available, such those provided by the various CMB experiments (see [5] 
for details). 
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6. Proofs 

In this section, for sake of notational simplicity, 

• the sub- and superscripts j are omitted 

• for quantities a(j) and 6(j) depending on j, a ^ b means: 3c > 0, Vj G N, a{j) < cb{j) 



• a >i b means: a <b and b < 



a. 



The following fact will be used extensively in the proofs. Let (^p, Xjy)i<p<N be a pixelization of 
order 43^+1 

= A^i ^b2(S-J£)^^^ii X Afci'B^^b2(S-J£)'B-^£ x 1 (8) 

i>0 ^ i>0 



as 4 h{x)xdx > and A^T^S^ x S^J using ([lb] 
6. 1 . Proof of Propositions [3\ and [4] 



E[%%,] = {XkXk'r'^^-E II x{ox{e)MO^k'{e)d^d^' 

e e' 



(AfeAfcO"'/'E ff Y. J2 J2 ae,^ai,,^,,yi^Umi',rn'ie)MOi'k'{e)d^de 



= (AfeAfeO"'/'5Z^^ E hl.rrmMOdi hu-aiOi'k'iOd^ 



e.>Q m=- 
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which proves Proposition [3] On the other hand Cfc = X) S biZi,rn^i,m{£,k) where 

i>0 m=-l 



N 



Zl,, 



dof 



7 ^ ^pZp^l^rn\S,p) 
p=l 



Thus 



Gov [CfcCfc'] = X! X! X! ^<'^<?'Cov[z£,m,Z£',m']yf,m(^fe)yr,m'(Cfc') 

I I' / N \ 

= E E E M£' 5]A2a2y,^„(ep)yr,™'(ep) y^,™(a)y£',m'(& 

l,l'>Om=-e.mi = -ei \p=l / 



N 



/T-^r— E ^l'^l'^k{ip)'4^k- {£.p) 
V>^kAk' p^ 



6. 2. Proof of Proposition [^ 



|EC-C| 


1 


c 


c 




1 
c 



5^u;,{Var(7,)-n2-C} 

kex 



AT 



E iJ2\pWpXiQMip)j -E (5]ApX(ep)^fe(Cp) 

/TV AT ^ 

kex \p=i p=i y 

< § E ""^^fe ' I E ( E ^(1 - Wp)^iip)M^p)] E ( E Ap^(ep)v^fe(ep) ) 

feesc [ \p=i / \p=i / 



1/2 



/ce3C 



.P=i 



p=i 






feG3C 

In particular, ii W = 1, one can take t = in the definition of % and the above result shows that 
C is unbiased. 



6. 3. Proof of Proposition [H 

Define 77fc = X]^^''^{il)k,X)^ . 
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Var(C) 1 V- ^ r 2 21 2 .r-^ ^ 2r i 

— C^ — = C^ Z^ w^fcWfc/Cov[77j^,77;,,J = ^ 2^ WfeWfe'Cov [Tyfc,??^/] 



k.k'e'X k.k'eJC 



2 
7^ X! '^fc'^fc' { ^°^ fe ' ^fc' ] ^ Co'^ iVk , ?7fc' - ??fe' ] 



(J2 



k.k'eX 

1 2 



- Gov [?7fc - 77fc, fjk'] + Gov [r^fc - Vk, Vk' - Vk']}' 

—2 ^ WkWk' { Gov ^ [77fe ,f]k>]+ Gov ^ [77fc , fik' - rik' ] + Gov ^ [77^ - ?7fc , ?7fc' - %■' ] } ■ (9) 



fc,fc'G3C 



For the first term in tlie r.h.s. of Eq. ([9]), we use the following bound, proved in [1]. 



<gj(2-Q 



E^.^^^^^fe-^^-))^ n+3MG,G0F ■ (^'^ 



^>o 



Thus, with Lemma [Hand using that C x S^^^-") and iV x ^B^^ 

y^ WfeWfe'Gov^[7yfc,r7fc/] = ^ WfeWfc' { ^ ^^Qif (6 ■ Cfc')} 

f>0 



k,k'eoc k,k'ex e>o 



< m2j(2-a) V- \WkWk' 

fc,fc'e3C 



^\^„,fi + a3^d(a,6'))'*' 



For the other terms of the decomposition jQ]), one can write 

Var (77,-7;,) = \-'E\S2Xp{l~Wp)X{^p)M^,yj 

(N N \ 

N 



p=i 
^ Nt (11) 



thus 



and 



y^ WfeWfc' {Gov [?7fc - 77fc, 77,./]} < E WfcVar (77, - 7;,) ^ w,/ Var (77,.) 

< cm 



y^ WfeWfe' {Gov [77, - 77fe, 77,/ - 77,/]}^ < { E WfcVar (77^ - 77^)}' 
k.k'eJC kex 



Putting these terms together establishes Proposition [H 



imsart-generic ver. 2008/01/24 file: ejs_2008_264.tex date: July 14, 2008 



Fay and Guilloux / Needlet spectral estimator on the sphere 15 

6. 4 ■ Proof of Theorems [7] and [2| 

We have 

^.^, lEC-Cp Var(g) 

By Proposition [71 the bias term is bounded by 



C 

and converges to thanks to Condition [m]). 
On the other hand 

Var(C) 1 ^ ^ r 2 21 2 ^^ ^ 2r 

— C^ — = C^ Z^ w;feu;fe'Cov[7fc,7fc,J = — ^^ WfcWfc.Cov [7^,7^'] 

k,k' = l k,k' = l 

= -^ ^ WfcWfc'{Cov[77fc,77fc/] +Cov[Cfe,C/c']} 
k,k'£3C 



(J2 

k,k'£OC 



- ^ T. WkWk'{Co^,^[Vk,Vk']+Co^,^[CkXk']} . (12) 

From Proposition [H] 

^ Y^ WkWk'Cov^[rjk,Vk']< Yl n" 



"^ \wkWk'\ , iVi , /iVA' 



k,k'eX k,k' = l ^ vsft,iSA, y; \ / 

where ^ < S^-^i — > with Condition [m]) . To cstabhsh Theorems [1] and [5] it suffices now to 



j-tco 

prove that in both cases 

\WkWt 



kM' = l 

and 



V ''"'''"fe'l , (13) 

^ V WkWk'Cov'iCkXk'] ^ . (14) 

fc,fc'63C 

Let us consider separately the cases of Theorems [T] and [21 
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6. 5. End of proof of Theorem [7] 

The convergence of Eq. (fT3|l is nothing else but Condition |I|. To prove p4|) . use the Property ([Tc|) 
and the Cauchy-Schwartz inequahty and write 

I I ^ 

fe,fc'G3C k.k'eOC ^ ^' p=l 

TV 

k,k'£OC p=l 

JV Af 

< E i^'o^'o'i{E^p^>p}{E^pi'/'fc(^>'='(^p)i'} 

k,k'£JC p=l p=l 

TV 

^ lE^p^'^p} E i^fc^'^'i 



,2^^ 1 



p=l k.k'eX 

N 

o{C) 



3-*oo 

with Conditions m and 112) and Proposition [5l 

6. 6. End of proof of Theorem [^ 

The two following remarks will prove useful. First, from Condition Irv)) . (c^^ A 1)(C + n^) < 
(C + nl) < (cV l)(C + 7i^J so that 

w,:<^{C + niyhke3C • (15) 

Second; using again Condition Irv)) 

where 

V^^^{keX:nl< rC} . (17) 

The convergence of Eq. ((T3|) is established thanks to Conditions [I]) and [ul| and Lemma [T] 



Af 



(i + SJd(a,6'))'^^ '^ ^2 2^ (1 

C-^ XT- 1 



^ (1 + 13J-d(a,6'))'^' ^' ,^3, (1 + S^d(a,6'))'*' 



< 



S.TT 



S' ^,f;?^-(i + «'<'(&,&-))"' 



For the convergence of Eq. ([H]), note that Cov[Cfc,Cfc'] ^ Z^^i •^pW^p^pV'fe(^p)V'fc'(Cp) where 
the implicit constant does not depend on fc, k' (Proposition[3]and property px|) of the pixelization). 
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For any constant k e (0, 1), the value of which will be fixed later, we have 



k,k'GJC ' k,k'eX 

N 



with 



x{T.>'p^yp^'^(^p^^'''^^p^y 



< ^^(^1+^2+^3+^4) 



N 



Ai = J2 iC + nl)-'{C + nl)-^Y.^P^p''>'^(^p)^'^'(^p)y 



k.k'ex p=l 

A2 ^ Y. iC + nlr\C + nl)-'{ E \W^ 4M^p)^k' {ip)V 

fc,fc'ex pGD{k,k') 

where D{k,k') = {p : d(6,?p) > ^®~"^' and d(^k' Ap) > ^'B'^'} 
As ^ E iC + nl)-HC + nl,)-'{nl E KMip)^k'{QV 

k,k'e5c p:rf(5fc,Cp)<i3-»J 

d(«fc,«fc')>23-''J 

Remark, ^s noticed in Section \3.1[ the weights proposed here are linked with the asymptotic un- 
correlation of the needlet coefficients. Ai includes the near-diagonal terms of the noise covariance 
in the needlet coefficients domain ; A2, A^ and A4 include the off- diagonal covariances, for which 
the localization of the needlets is crucial. In A2, the "near-disjointness" of the supports of ipk and 
-ipk' is only considered sufficiently far away from ^k o,nd S,k' , while in A3 it is considered around ^k 
(or equivalently £,k')- Finally, A4 depends on the regularity of p 1-^ a'i. 

We shall use the fact that: Vx, C G R+, iq^I^-m < niin \^ ^ ; jr ? • 

With Cauchy- Schwartz inequality, W < \ and the properties of the cubature points (including 
Lemma [2]) we have 

Ai < E iC + nl)-\C + nl)-^Y.Xpal^l:l{^p)){JZ\^li^im 

fc.fc'eiK p=l p=l 

^ {C + nfy ^ {C + nlY 

keX ^ k' fc'esc ^ f' ' 



rf(?fc.«fc')<S- 






Thanks to Eq. (fT6|) and Condition [U 



^4 



^^ < A,f2 (!+'") m-2K7 _ m2j(2--Ki-K) ^ q 

C252 - (CardV^)2 '^ j^oo 

provided that 

K > 2 - Ki . (18a) 
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To give upper bounds on A2 and A3, we use Proposition [5] Define 



dcf I ,/A A \ 1~ ■'" - w. . . 1, 



D{k, k') '':^'\p: d{ik,ip) > --B--' and d{^k',^p) > ^'B''^' 



Write 



A, < J2 iC + nl)-'{C + nl)- 



k.k'eJC 
X • 



peD{k.k'} 

< Y. ic+nir^ic+nir' 

k,k'&X 

pern.') i^ + '^'d{^„i,))^^{l + 'B^d{^„i,,)V'^ 

fe,fe'e3C V ^ 2 '^ p=i 

\ 2 



so that, with Eq. (fT6|) and Condition [nl), 

^25-2 -- yZ^p=i P Pj j^oo 

provided that 

A/>^i±^ . (18b) 

1 — K 

Remark now that if d{^k,^k') > B''^^ and d(C,6) < 5®"'^-'' then d{^,Ck') > ^diCk,^k') so that 

1 > 1 Tlnne 

^3 < E (C + n2)2"(^ + ,2)2 E A,|^.(Cp)^.(Cp)r 



A, 

(C + n2)2(C + n2„)2^ _ ^^,^_^^^. (l + SJd(a,6'))'*' 



^ (C + n2)2(C + n2)2 ^^ 



C2 

and with Condition |i]) 



< \^ !!fe m4:jn,~j{l-K.)2Mn,-2iij 

k.k'eX ^ ' kJ \ ' k' J 

Ar2 

< ^2.7(2-(1-k)M) 



^3 ^ _O_[:j;0_^222i(2-(l^K)M) _ ^2i(4-Ki-(l-K)A/) ^ q 



C25'2 '^ (CardT/)2 j^oo 

provided that 

M > i^^ . (18c) 

1 — K 

For the last term A4, we use Condition [u]) , the same arguments as for A3, and Lemma [5] which 
takes advantage of the locahzation of il)k around point ^k to compare the local noise level in direct 
and needlet domains. The estimate depends on the regularity g oi u^ . 
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l<fc<fc'<Afj 

that for all i G N, e > 0, and k,k' € {1,. . . , Nj} with d{S,k,^k') < e 



Lemma 9. Define a,- = max - — '''''^^ti — , j,fc'""j ■'''-' ^ There exist a constant c > such 



l(MG-,fc"T„fcO' - Kfc)'l <cUnj,kf (l - STW^) ' + g^ (e + Stw^) 



The proof of Lemma [9] is postponed to the end of the Section. For 



M > ii^ (18d) 

1 — K 



and using Lemma [H] with e = -jS '^^ we get 



l-M 



Ai < l-ST+iuJ A 



-2 



Vf-23-«^+15^^Y V ^P:dii.,i.}<i^-^^^P\^^=^^P^^'^'(^P^\ 



2 ' J ^ {C + nly{C + nl,y 

as ytIj < — k. Note now that under [il| and using px| and pe)) . we have 

0('3('^2+3)A 









and 
provid 


ed that 


^4 ^ ^3 ^2j( 
^2^2 - ^2^2 ^ 



I rg2i{2-Ki+K2+3 + a-(l-K)A/) ^ Q 

Af>^ + "'"^ + "^ . (18e) 

1 — K 

It remains to see that their exists a k e (0,1) satisfying simultaneously the conditions pSap 

to (HHel). This is the case if M > max |^^ ; ^^ ; ^+"^'^;i\+'^" |. It suffices to take k e (2-ki, 1) 

sufficiently near to 2— ki (recall that ki > 1). Finally, remark that any m satisfyingQis bounded by 
2, then the above condition on AI reduces to the one mentioned in the statement of the Theorem. 
This concludes the proof of ([T4|l and Theorem [2l 
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Proof of Lemma [P| 

First, for any (5 > 0, using ^i, ([Te]), ^,Q<W <l and the definition of g 



20 



\wM-<\ 



^f' p=i 
1 ^ 1 " 

p—1 p—1 

P-d.{ip.ik)<S p:rf(?p,?(=)><5 



< eS + 



w 



(l + SJ5)^^ 



Y \p\Wlal-Wy\+Wla^ 



k 



< e[S + 



■BJ 



Second 



N 



(1 + SJ(5)*=f 



JV 



p--d{ip.ik)>S 



wM 



(19) 



Af 



Yxpwyy^ii,) = M/.v^Ev^'(^p)-Ev(w^^^-w^pV^)V'^(c,) 



p=i 



p=i 



p=i 



AT 



p=l p:rf(?p,Cfc)<'5 p:rf(?p,«fc)>'5 

W \ ( SJ 



> Wlal ( 1 - 



-eU 



(1 + 3j5)^^, 
Combining Eqs. P^ and (^0]) . one gets, for any fc' with d{^k,S,k') < e 

■BJ 



(20) 



< £»e + e U + 



+ [ni + g[5 + 



(1 + SJ(5)^f 



nil I 



W 



(1 + SJ(5)^^ 
-1 



1- 



SJ 



(1 + SJ(5)*^ 



(1 + SJ'5)^^ 



(1 + -Bjsy^ 



1+1- 



SJ 



(1 + "BUy^ 



1 ~ IVl ■ 

Particularizing this last expression with (5 = ® i+mj leads to Lemma |91 since we have, for M > 3, 
j > 1 and this 5: 



< 



(l + 'B^S)"'' ^ 2 
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